ClassTR: Classifying Within-Host Heterogeneity Based on Tandem Repeats with Application to Mycobacterium tuberculosis Infections

نویسندگان

  • Leonid Chindelevitch
  • Caroline Colijn
  • Prashini Moodley
  • Douglas Wilson
  • Ted Cohen
چکیده

Genomic tools have revealed genetically diverse pathogens within some hosts. Within-host pathogen diversity, which we refer to as "complex infection", is increasingly recognized as a determinant of treatment outcome for infections like tuberculosis. Complex infection arises through two mechanisms: within-host mutation (which results in clonal heterogeneity) and reinfection (which results in mixed infections). Estimates of the frequency of within-host mutation and reinfection in populations are critical for understanding the natural history of disease. These estimates influence projections of disease trends and effects of interventions. The genotyping technique MLVA (multiple loci variable-number tandem repeats analysis) can identify complex infections, but the current method to distinguish clonal heterogeneity from mixed infections is based on a rather simple rule. Here we describe ClassTR, a method which leverages MLVA information from isolates collected in a population to distinguish mixed infections from clonal heterogeneity. We formulate the resolution of complex infections into their constituent strains as an optimization problem, and show its NP-completeness. We solve it efficiently by using mixed integer linear programming and graph decomposition. Once the complex infections are resolved into their constituent strains, ClassTR probabilistically classifies isolates as clonally heterogeneous or mixed by using a model of tandem repeat evolution. We first compare ClassTR with the standard rule-based classification on 100 simulated datasets. ClassTR outperforms the standard method, improving classification accuracy from 48% to 80%. We then apply ClassTR to a sample of 436 strains collected from tuberculosis patients in a South African community, of which 92 had complex infections. We find that ClassTR assigns an alternate classification to 18 of the 92 complex infections, suggesting important differences in practice. By explicitly modeling tandem repeat evolution, ClassTR helps to improve our understanding of the mechanisms driving within-host diversity of pathogens like Mycobacterium tuberculosis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Brucella melitensis and Mycobacterium tuberculosis depict overlapping gene expression patterns induced in infected THP-1 macrophages

Pathogens infecting mammalian cells have developed various strategies to suppress and evade their hosts’ defensive mechanisms. In this line, the intracellular bacteria that are able to survive and propagate within their host cells must have developed strategies to avert their host’s killing attitude. Studying the interface of host-pathogen confrontation can provide valuable information for defi...

متن کامل

Genotypic and phenotypic heterogeneity among Mycobacterium tuberculosis isolates from pulmonary tuberculosis patients.

Although the heterogeneity of Mycobacterium tuberculosis populations and the existence of mixed infections are now generally accepted, systematic studies on their relative importance are rare. In the present study, 10 individual colonies of each M. tuberculosis isolate (primary isolate) from 97 tuberculosis patients in a primarily human immunodeficiency virus-negative population were screened f...

متن کامل

Molecular epidemiology of disease due to Mycobacterium bovis in humans in the United Kingdom.

Mycobacterium bovis is the causative agent of bovine tuberculosis, with a wide host range. Fifty human M. bovis isolates were typed using spoligotyping and variable number tandem repeats (VNTR). Fifteen of these spoligotypes have not yet been recorded in cattle. The predominant spoligotype in humans and cattle was subdivided by VNTR.

متن کامل

Molecular typing of Mycobacterium tuberculosis by using nine novel variable-number tandem repeats across the Beijing family and low-copy-number IS6110 isolates.

Molecular epidemiological tools for genotyping clinical isolates of Mycobacterium tuberculosis have been developed and used to help track and contain transmission of tuberculosis. We identified 87 short sequence repeat loci within the genome of the M. tuberculosis H37Rv strain. Nine tandem repeats were found to be variable (variable-number tandem repeats [VNTRs]) in a set of 91 isolates. Fifty-...

متن کامل

Evaluation of MIRU-VNTR for typing of Mycobacterium bovis isolated from Sika deer in Northeast China

BACKGROUND Bovine tuberculosis has led to serious economic losses for Sika Deer producers in China. Strategies for controlling the spread of Mycobacterium bovis are often hampered by a lack of epidemiological data. Specifically, tracing infections requires the ability to trace back infections, which, in turn, requires the ability to determine isolates with a common source. This study was planne...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2016